Buffer Compliance Control of Space Robots Capturing a Non-Cooperative Spacecraft Based on Reinforcement Learning

نویسندگان

چکیده

Aiming at addressing the problem that joints are easily destroyed by impact torque during process of space robot on-orbit capturing a non-cooperative spacecraft, reinforcement learning control algorithm combined with compliant mechanism is proposed to achieve buffer compliance control. The can not only absorb energy through deformation its internal spring, but also limit safe range combining strategy. First all, dynamic models and target spacecraft before capture obtained using Lagrange approach Newton-Euler method. After that, based on law conservation momentum, constraints kinematics velocity, integrated model post-capture hybrid system derived. Considering unstable system, for stable associative search network employed approximate unknown nonlinear functions, an adaptive critic utilized construct signal tune network. numerical simulation shows scheme reduce acting 76.6% maximum 58.7% minimum in operation phase. And phase, were limited within safety threshold, which avoid overload damage joint actuators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Non-Cooperative Interaction via Reinforcement Learning and Its Control∗

A general setting is considered in which an arbitrary set of autonomous users interact by means of a controlled Markov process. This process is driven by the collective actions of users, and users receive separate payoffs according to its state. The only other assumption on the process is that it has a finite state space. Dynamic behavior of user strategies is characterized in the case when eac...

متن کامل

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

Cooperative control of modular space robots

Modular self-assembling on-orbit robots have the potential to reduce mission costs, increase reliability, and permit on-orbit repair and refueling. Modules with a variety of specialized capabilities would self-assemble from orbiting inventories. The assembled modules would then share resources such as power and sensors. As each free-flying module carries its own attitude control actuators, the ...

متن کامل

Task-space Control of Electrically Driven Robots

Actuators of robot operate in the joint-space while the end-effect or of robot is controlled in the task-space. Therefore, designing a control system for a robotic system in the task-space requires the jacobian matrix information for transforming joint-space to task-space, which suffers from uncertainties. This paper deals with the robust task-space control of electrically driven robot manipula...

متن کامل

Cooperative Behavior Acquisition in Multi Mobile Robots Environment by Reinforcement Learning Based on State Vector Estimation

This paper proposes a method that acquires the purposive behaviors based on the estimation of the state vectors. In order to acquire the cooperative behaviors in multi robot environments, each learning robot estimates local predictive model between the learner and the other objects separately. Based on the local predictive models, robots learn the desired behaviors using reinforcement learning....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app11135783